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POLYNOMIAL CONSTRAINT SATISFACTION PROBLEMS, GRAPH 
BISECTION, AND THE ISING PARTITION FUNCTION 

ALEXANDER D. SCOTT* AND GREGORY B. SORKIN 

Abstract. We introduce a problem class we call Polynomial Constraint Satisfaction Problems, 
or PCSP. Where the usual CSPs from computer science and optimization have real-valued score 
functions, and partition functions from physics have monomials, PCSP has scores that are arbitrary 
multivariate formal polynomials, or indeed take values in an arbitrary ring. 

Although PCSP is much more general than CSP, remarkably, all (exact, exponential-time) al- 
gorithms we know of for 2-CSP (where each score depends on at most 2 variables) extend to 
2-PCSP, at the expense of just a polynomial factor in running time for adding and multiplying 
polynomials rather than reals. Specifically, four algorithms that extend are: the reduction-based 
algorithm of Scott and Sorkin; the specialization of that approach to sparse random instances, 
where the algorithm runs in polynomial expected time; dynamic-programming algorithms based on 
tree decompositions; and the split-and-list matrix-multiplication algorithm of Williams. 

This gives the first polynomial-space exact algorithm more efficient than exhaustive enumera- 
tion for the well-studied problems of finding a minimum bisection of a graph, and calculating the 
partition function of an Ising model. Furthermore, PCSP solves both optimization and counting 
versions of a wide range of problems, including all CSPs, and thus enables samplers including 
uniform sampling of optimal solutions and Gibbs sampling of all solutions. 

i o i 

CO ■ 1. Introduction 

>. 

We introduce a problem class we call Polynomial 2-CSP, which generalizes more common 2-CSP 

constraint satisfaction problems, yet is solvable by adaptations of all the best CSP algorithms we 
know of, with essentially equal efficiency This provides the first efficient polynomial-space exact 
algorithm for the well-studied problem of finding the maximum bisection of a graph (exponential- 
space algorithms were previously known), and the first exact algorithm of any sort, other than naive 
exhaustive enumeration, for the equally important problem of computing the partition function of 
an Ising model. It also solves a breadth of other optimization problems beyond the class 2-CSP, 
including some quite convoluted ones, in a simple, uniform framework. It allows counting and 
sampling as well as optimization, and again, does this in time bounds matching those for 2-CSP. 
/S \ Let us elaborate. 

Many decision and optimization problems, such as 3-colorability, Max Cut, Max fe-Cut, and Max 
2-Sat, belong to the class Max 2-CSP of constraint satisfaction problems with at most two variables 
per constraint. A Max 2-CSP (or for short, simply CSP) instance defines a "score function" or "soft 
constraint" on each vertex and edge of a "constraint graph" , and its solution is a vertex coloring 
or "assignment" maximizing the total score. 

We define a more general class we call Polynomial 2-CSP, or simply 2-PCSP or PCSP. (The PCSP 
definition extends obviously to longer clauses, but much less is known about algorithms even for 
3-CSP.) Comparing with a CSP, the vertex and edge scores of a PCSP are (potentially multivariate) 
polynomials (rather than real numbers), the score of a coloring is the product of all vertex and 
edge scores (rather than their sum), and an instance's "value", or "partition function", is the sum 
of scores over all colorings (rather than the maximum). In fact the scores may be "generalized 
polynomials", whose powers are not restricted to positive integers but may be arbitrary reals, but 
we use "polynomial" for want of a better term. 
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Compared with the "max-sum" CSP formulation typical in the CSP community, PCSP has 
a "sum-product" form commonly seen as the "partition function" in statistical physics: a sum 
over configurations, of a product of exponentials of "energies". In the physics settings we are 
familiar with, the partition function is always a product of monomials (sometimes multivariate, for 
example with energy and magnetization), whereas our PCSPs permit polynomials. With general 
polynomials, PCSP is closed under a variety of operations enabling its (relatively) efficient solution 
by a number of algorithmic approaches. 

PCSP includes many optimization problems not in the class CSP, including Maximum Bisection, 
Maximum Clique, Sparsest Cut, various judicious partitioning problems, Max Ones 2-Sat, and many 
others. Capitalizing on the fact that a PCSP's score functions may be multivariate polynomials, 
in all these cases, we simply track several score functions at once. For example, for Max Bisection 
we track the size of one partition as well as the size of the cut. 

The i^-coloring of a graph G has received much recent attention, including the book [HN04J , the 
application to soft constraints in [BG 05], and the application to asymmetric hard constraints in 
[DGP06J. //-coloring can be encoded as a PCSP by giving every edge of G the same (symmetric) 
score function. However, PCSP can also give a different (and asymmetric) score function to each 
edge, and indeed this is an essential feature as it leaves the class closed under various reductions. 

PCSPs give a powerful technique for counting and sampling solutions. Given a CSP, there is 
a natural way to obtain a corresponding PCSP. Solving the PCSP gives a generating function for 
the number of CSP assignments of each possible score, which allows us to count or (by succes- 
sively solving subinstances) to sample uniformly from solutions of maximum score, or to sample all 
solutions according to the Gibbs distribution or any other score-based distribution. 

What makes PCSP so interesting is that, despite being more general than CSP, all the best 
CSP algorithms we know of can be extended to solve PCSP, with the same running-time bounds. 
(Exact CSP algorithms are generally exponential-time, and solving a PCSP takes just a polynomial 
factor longer than solving a CSP instance of the same size, for the extra overhead of adding and 
multiplying polynomials rather than reals.) Specifically, four algorithms that extend are: the 
split-and-list matrix-multiplication algorithm of Williams [Wil04j; the reduction-based algorithm 
of Scott and Sorkin [SS07]; the specialization of that approach to sparse random instances, where 
the algorithm runs in polynomial expected time jSS06bj : and dynamic-programming algorithms 
based on tree decompositions [JKLS051 ISS071 IKMRR09] . 

Both the definition of PCSP and all our algorithms only require us to take sums and products 
of polynomial "scores" . It is thus natural to work in a still more general class that we call Ring 
CSP, where each score function and the "value" of an instance are given by element of an arbitrary 
ring R; PCSPs are a special case where R is a polynomial ring. 



2. Outline 

Following a small amount of notation in Section [3l Section H] defines the classes CSP, PCSP and 
RCSP. Section [5] provides examples of CSPs and related PCSPs; because this includes many prob- 
lems beyond CSP, such as graph bisection, the Ising partition function, and judicious partitioning, 
it is a focal point of the paper. 

In Sections EHSl we show how four CSP algorithms extend to PCSP/RCSP, and analyze them in 
terms of the number of ring operations (or sums and products of polynomials in the case of PCSPs) 
that they require. In Section [10] we consider the time and space complexity of the ring operations 
for PCSPs. We put the two together in Section [TTT summarizing the time and space complexity of 
each algorithm, and noting the circumstances favoring one algorithm over another. 

In addition to counting maximum CSP solutions, a PCSP's partition function also enables con- 
struction of an optimal solution, and various forms of perfect random sampling from optimal solu- 
tions or all solutions; this is taken up in Section [i"2l 
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3. Notation 

In the next section we will define the class of CSPs and our new class PCSP of Polynomial CSPs. 
An instance of either has a "constraint graph" G = (V, E) with vertex set V and edge set E, and 
we reserve the symbols G, V and E for these roles. An instance of CSP or PCSP also has a domain 
of values or colors that may be assigned to the vertices (variables), for example {true, false} for 
a satisfiability problem, or a set of colors for a graph coloring problem. In general we will denote 
this domain by [k], interpreted as {0, . . . , k — 1} or (it makes no difference) {1, . . . , k}. At the heart 
of both the CSP and PCSP instance will be cost or "score" terms s v (i) (v £ V(G), i £ [k]) and 
s X y(i,j) (x,y £ V(G), i,j £ [k]). 

For CSPs the scores s are real numbers, while for PCSPs they are polynomials which we write in 
the variable z if univariate, or if multivariate over z, w or z, w\,W2, ■ ■ ■ ■ Just as CSP scores s need 
not be positive integers, exponents in the PCSP score "polynomials" may be fractional or negative 
(or both). (The term "fractional polynomial" is used in the literature of statistical regression.) 

We use l±l to indicate disjoint union, so Vo ttl V\ = V means that Vo and V\ partition V , i.e., 
Vo (~l V\ = and Vo U V\ = V. The notation O* hides polynomial factors in any parameters, so for 
example 0( n 2 5+19m / 100 ) is contained in O*(2 19m / 10 °). 

4. CSP, Polynomial CSP, and Ring CSP 

4.1. CSP. Let us begin by defining the problem class CSP over a domain of size k. An instance I 
of CSP with constraint graph G = (V, E) and domain [k] has the following ingredients: 

(1) a real number s$; 

(2) for each vertex v £ V and color i £ [k], a, real number s v (i); 

(3) for each edge xy £ E and any colors i, j £ [k], a real number s xy (i,j). 

We shall refer to these quantities as, respectively, the nullary score, the vertex scores, and the edge 
scores. Note that we want only one score for an edge {i,j} with given colors {x,y} assigned its 
respective endpoints, so s xy (i,j) and s yx (j,i) are taken to be equivalent names for the same score 
(or one may simply assume that x < y). 

Given an assignment (or coloring) a : V — ► [k], we define the score of a to be the real number 

1(a) := s + Y^ s v (a(v)) + ^ s xy (a(x),a(y)). 

veV xyeE 

The instance's solution is the maximum possible score 

(!) max 1(a) = max (s + Y] s v (a(v)) + Y] s xy (a(x),a(y))) 

L J l ' v£V xy£E 

or the assignment a achieving the maximum; it is straightforward to get either from the other. For 
obvious reasons, ([1]) is commonly referred to as a max-sum formulation. 

In this paper we take a generating-function approach, among other things enabling us to count 
the number of solutions satisfying various properties. Given an instance / of CSP, the corresponding 
generating function is the polynomial 

(2) £ **<"). 

ff-.V-^[k] 

More generally, if we want to keep track of several quantities simultaneously, say 1(a), J (a), ■ ■ ■ , 
we can consider a multivariate generating function ^2 a z^Wto^W • • • . Calculating the generating 
function in the obvious way, by running through all k' ' assignments, is clearly very slow, so it is 
desirable to have more efficient algorithms. 

In working with generating functions, we borrow some notions from statistical physics. We think 
of the score 1(a) as a "Hamiltonian" measuring the "energy" of a configuration a. Thus edge scores 
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correspond to "pair interactions" between adjacent sites, while vertex scores measure the effect of 
a "magnetic field". (From this perspective, the nullary score is just a constant that disappears 
after normalization.) The generating function is then the "partition function" for this model. (The 
partition function is often written in the form Y2 a ex P( — PI( a ))} where f3 is known as the inverse 
temperature, but substituting z for e~@ yields the partition function in polynomial form.) 

A crucial element of our approach is that the score 1(a) can be broken up as a sum of local 
interactions, and thus an expression such as z ^' can be expressed as a product of monomials 
corresponding to local interactions: 



z I(a) = z s 9 . "Q z s v (a(v)) . -Q . 



yS xy (ff(x),ff(y)) 
vEV xyeE 

In order to provide a framework for this approach, we introduce a generalized version of constraint 
satisfaction where the scores are polynomials (in some set of variables) instead of real numbers, 
and the score of an assignment is taken as a product rather than a sum. 

4.2. Polynomial CSP. An instance I of Polynomial CSP, with constraint graph G and domain [k], 
has the following ingredients: 

(1) a set of formal variables, and various polynomials over these variables: 

(2) a polynomial p$; 

(3) for each vertex v £ V and color i G [k], a polynomial p v {i); 

(4) for each edge xy G E and pair of colors i,j S [k], a polynomial p xy (i,j)- 

We refer to these three types of polynomial as, respectively, the nullary polynomial, the vertex 
polynomials, and the edge polynomials. (Reiterating from Section we use "polynomial" in a 
general sense, allowing negative and fractional powers.) We want only one polynomial for a given 
edge with given colors on its endpoints, so again we either take p xy (i, j) and p yx (j, i) to be equivalent 
or simply assume that x < y. 

Given an assignment a : V — > [k] , we define the score of a to be the polynomial 

(3) I{a) := p • Yl Pv(o-{v)) ■ JJ p xy (a(x),a{y)). 

v£V xy&E 

We then define the partition function Zj of I by 

(4) Z i= /Z / ( fJ )= E V%-\{pv{o-{v))-J[p xy {o-{x),o-{y)). 

a: V^[k] a: V->[k] v&V xy&E 

The partition function is the "solution" of a PCSP instance. In contrast with the max-sum formu- 
lation of (pQ), (J!]) is referred to as a sum-product formulation. The sense of the PCSP formulation 
will become clearer with examples in the next section. 

4.3. Ring CSP. The definition of I(cr) in equation (|3|), and for the most part our analysis in later 
sections, requires only addition and multiplication of scores. It therefore makes sense to work in a 
still more general context where the score functions are elements of an arbitrary ring. We define an 
instance I of RCSP over a ring R, with constraint graph G and domain [k], to have the following 
ingredients: 

(1) a ring element r^; 

(2) for each vertex v £V and color i 6 [fc], a ring element r v (i); 

(3) for each edge xy £ E and pair of colors i, j G [k], a ring element r xy (i,j). 

The score of an assignment a : V — > [k] is the ring element 

j o) := ^ • n r ^(°"( t ')) • n r zs/( CT ( x )' a (y))' 

veV xyGE 
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and the partition function Zj of I is 

Zi= Y. Up). 

a: V-*[k] 

Thus PCSPs are the special case of RCSPs where the ring R is a polynomial ring over the reals. 

5. Examples 

In this section we show how some standard problems can be written as PCSPs. First we show that 
for every CSP there is a naturally corresponding PCSP; then we illustrate how various problems, 
including some that are not CSPs, can be expressed as PCSPs. We will turn to algorithms in later 
sections, but for now bear in mind that a PCSP over a constraint graph G can be solved about as 
efficiently as a CSP over the same graph. 

5.1. Generating function of a simple CSP. 

Definition 1 (Generating function of an instance). Given an instance I of CSP on a constraint 
graph G = (V 7 , E) and variable domain [k], we can define a corresponding instance I* of PCSP with 
the same graph and variable domain, and polynomials 

PV> = z S9 
(Vu 6 V, Vi 6 [k]) Pv (i) = z 8 ^ 
(Vxy G E, Vt, j G [k]) Pxy (i,j) = z*»M. 
The connection between / and I* is given by the following simple observation. 

Lemma 2. Let I be an instance of CSP, and let I* be the corresponding PCSP instance. Then the 
partition function Zj* is the generating function (J2j) for the instance I. 

Proof. For any assignment a, we have 

I* (a) = z s * ■ Y[ z Sv(i) ■ ]~[ zSxy{i ' J) 

v£V xy£E 

— Z s 9+T,vev ^i^+T.xyeE s ^y( i 'i) 

It therefore follows that the partition function 

is the generating function for the original constraint satisfaction problem. □ 

Similar results are easily seen to hold for generating functions in more than one variable. 

5.2. Max Cut and Max Dicut. Max Cut provides a simple illustration of Definition Q] and 
Lemma [2j Let us first write Max Cut as a CSP; we will then construct the corresponding PCSP. 

Example 3 (Max Cut CSP). Given a graph G = (V, E), set k = 2 and define a CSP instance I by 

s = 
(Vu G V) s„(0) = s v (l) = 
(Vxy E E) s xy {0, 1) = s xy (l, 0) = 1 , s xy (0, 0) = s xy {l, 1) = 0. 
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With a' 1 ^) = {v. a(v) = i}, note that (Vo, V\) = (cr _1 (0),cr~ 1 (l)) is a partition of V, and 

(5) 1(a) = J2 l = <^i) 

xyeE: 
a(x)=0, a(y) = l 

is the size of the cut induced by a. The corresponding PCSP instance is obtained as in Definition [TJ 

Example 4 (Max Cut PCSP). Given a graph G = (V,E), we set k = 2 and define a PCSP 
instance I by 

P% = 1 
(VveV) Pv (0) = Pv (l) = 1 

(VxyeE) p xy (0,l) =p xy (l,0) = z , p xy (0,0) =p xy (l,l) = 1. 

(In all such cases, a 1 on the right hand side may be thought of as z°.) 

By Lemma [21 the partition function Zj is therefore the generating function for cuts: 

(6) Z J = J2 2 < ; iZ i , 

where q is the number of cuts of size i, and the factor 2 appears because each cut (Vb, Vi) also 
appears as (Vi, Vq). The size of a maximum cut is the degree of Zj, and the number of maximum 
cuts is half the leading coefficient. 

Note that the partition function © is the partition function of the Ising model with no external 
field (see below for a definition). Thus we have recovered the familiar fact that, up to a change of 
variables, the partition function of the Ising model is the generating function for cuts. 

We can also encode weighted instances of Max Cut with edge weights w : E — > K by modifying 
the third line of the definition above to 

p xy (0,l)= Pxy (l,0) = z w ^ VxyeE. 

Similarly, we can encode weighted Max Dicut (maximum directed cut) by setting 

P® = 1 
(WEV) p„(0)=p„(l) = l 

(Vxy £ E) p xy (0, 1) = z w ^ , p xy (l, 0) = z w ^ , p xy (0, 0) = p xy (l, 1) = 1, 

where w(xy) denotes the weight of the directed edge xy, and we define w(xy) = if there is no such 
edge. Max fc-Cut is encoded the same way, only with /c-valued variables in place of binary ones. 

5.3. The Ising model and Max Bisection. A slight generalization of the previous example 
allows us to handle the Ising model. 

Example 5 (Ising CSP model). The Ising model with edge weights J and external field h on a 
graph G is defined in terms of its Hamiltonian H . For an assignment a : V — > {0, 1} ; we define 

H(a) = JJ^ o(a(x),a(y)) + h £ a(v). 

xy&E d£V 

Here 5(a, b) is the delta function, returning if a = b, and 1 otherwise, J is the interaction 
strength, and h is the external magnetic field. In analogy with ©, taking Vi = <7 _1 (i), we may 
rewrite H as 

H(a) = Je(V ,V 1 ) + h\V 1 \. 

Note that H is an instance of (plain) CSP. 
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The partition function of the Ising model at inverse temperature (3 is 

JVi|,,e(Vo,Vi) 
V \aVx=V 



(7) Rising = E e ^ CT) = E «••'" 



where we have written it; = e - ^ and z = e~" , and the last sum is taken over ordered pairs (Vo, V\) 
that partition V. With this change of variables, the Ising partition function is easily expressed as 
partition function of a PCSP, this time over two variables. 

Example 6 (Ising PCSP). Define an Ising PCSP instance I by 

P% = 1 
(to G V) Pv (0) = 1 
(Mxy £ E) p v {l) = w , p xy (0, 1) = Pxy(l, 0) = z , p xy (0, 0) = p xy (l, 1) = 1. 

With our usual notation, 1(a) = w' x 'z e ( °' x \ and so the partition function for this PCSP is 
equal to (J7|). 

More generally, the Potts model can be written in a similar way. Less generally, the zero mag- 
netization (h = 0) anti-ferromagnetic (J = —1) special case of this Ising model has maximum cuts 
as its "ground states" (lowest-energy states): taking f3 — ► oo means that these states dominate the 
sum comprising Rising- To view it another way, the maximum degree in z of Z(w, z) is the size of 
a maximum cut in G. 

The Ising PCSP can also be used to handle Max Bisection. This is important, because most 
CSP algorithms (other than dynamic programming) can solve Max Cut, but cannot be applied to 
Max Bisection because there is no way to force them to generate a balanced cut. The Ising PCSP 
does this by tracking two variables at once: the bisections of G correspond to terms in Rising that 
have degree \n/2\ in w. (Each bisection is counted once if n is odd, twice if n is even.) Extracting 
these terms gives the generating function for bisections of G. 

The same partition function also yields a sparsest cut of the graph: sparsest cuts correspond to 
terms with the largest ratio of the power of z (number of cut edges) to the power of w (number of 
vertices in one partition). 

5.4. Max Independent Set and Max Clique. Maximum Independent Set (MIS) is easily ex- 
pressed as a CSP: 

Example 7 (MIS CSP). 



(to E V) s„(0) = , s„(l) = 1 
(toy G E) s xy (0, 0) = 8^(0, 1) = s xy (l, 0) = , s xy (l, 1) = -2. 

Maximum cliquqj cannot be modeled in the same way, because the clique constraint is enforced 
by non-edges, which are not an element of the model. Of course a maximum clique in G corresponds 
to a maximum independent set in its complement graph G, but for our purposes this can be very 
different, as we typically parametrize running time in terms of the number of edges, and a Max 
Clique instance on a sparse graph G with \E\ edges becomes an MIS instance on the dense graph 
G with (2) — \E\ edges. We discuss this further in Section [T3j However, as with Max Bisection, 
it is possible to model a sparse Max Clique instance as a PCSP (with the input graph G as its 
constraint graph) by introducing a second variable. 



We use clique in the sense of a complete subgraph, not the stronger definition as a maximal complete subgraph. 
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Example 8 (Max Clique PCSP). 

»0 = 1 
(V« G V) s„(0) = 1 , a„(l) = u> 

(Vxy E £/) Ssy (0, 0) = 5^(0, 1) = s^(l, 0) = 1 , s xy (l, 1) = z. 

An assignment <r has score 1(a) = w' Vl 'z e ^ ' Vl \ the power of w counting the number of vertices 
in the chosen set V\ = cr _1 (l), and the power of z counting the number of edges induced by that set. 

In the partition function, /c-cliques correspond to terms of the form w z^ 2 ' . Of course, independent 
sets of cardinality k correspond to terms with w k z°: the PCSP simultaneously counts maximum 
cliques and maximum independent sets (among other things). 

The same approach also counts maximum-weight cliques and/or independent sets, in a graph 
with vertex and/or edge weights, if we introduce a third formal variable w 1 and set the scores as 
before except for s v (l) = ww' weis (v ' and s xy (l,l) = zw' weis ^' . Cliques correspond to terms 
w k z^ 2 'w ,weig , and the maximum-weight clique the term with the largest power of w'. 

5.5. Judicious partitions. Similar techniques may be applied to various judicious partitioning 
problems [BS99, Sco05j, such as finding a cut of a graph which minimizes max{e(Vo), e(Vi)}. 

Example 9 (Judicious bipartition PCSP). 

S0 = 1 
(Vv£V) s„(0) = a„(l) = 1 

(Vxy E E) s xy (0, 1) = s xy (l, 0) = 1 , s xy (0, 0) = z\ , s xy (l, 1) = z 2 - 

A simple calculation shows that 1(a) = Zq ° z± 1 , where Vi = a~ 1 (i). Thus the problem of 
minimizing max{e(Vo), e(Vi)} (or, for instance, the problem of maximizing min{e(Vo), e(Vi)}) can 
be solved by examining the partition function. 

We might further ask for a bisection that minimizes max{e(Vo), e(Vi)}, and this is easily achieved 
by introducing a third variable: we set s v (0) = w and examine only terms of the partition function 
that have degree [|^l/2j in w. 

5.6. Simultaneous assignments. We can think of the foregoing example, of finding a balanced 
bisection minimizing max{e(Vo),e(Vi)}, as a case of having more than one CSP on a single set of 
variables. In other examples, we might have two instances of Max Sat that we wish to treat as 
a bi-criterion optimization problem; or a Max Sat instance to optimize subject to satisfaction of 
a Sat instance; or we might wish to find a partition of a vertex set yielding a large cut for two 
different graphs on the same vertices. All such problems are now straightforward: use a variable z 
to encode one problem and a variable w to encode the second. (Thus, the initial edge scores are of 

thp form ^firstscore secondscore ^ 

6. SPLIT-AND-LIST, MATRIX-MULTIPLICATION ALGORITHM 

The algorithmic approach introduced by Williams [Wil04j . which he calls "split and list", is 
characterized by its use of fast matrix multiplication. For binary 2-CSPs it runs in time 0*(2^ n ' 3 ) , 
where uj is the "matrix multiplication exponent" such that two n x n matrices can be multiplied 
together in time 0(n^). It also requires exponential space, 0(2 2n ' 3 ). The best known bound on 
the matrix multiplication constant is uj < 2.37. . ., from the celebrated algorithm of Coppersmith 
and Winograd [CW90J, which works over arbitrary rings. 

Koivisto [Koi06b] extended Williams' algorithm from CSP (with a max-sum formulation) to 
counting-CSP (with sum-product formulation), and Koivisto's version transfers immediately to the 
PCSP and RCSP context. While the sum-product formulation is common in statistical physics, 
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Koivisto's use of it appears to be one of the few besides the present work in the field of exact 
algorithms. For Williams' algorithm, the sum-product formulation is not only more general but 
also simpler. 

6.1. Algorithm. Koivisto's counting extension of Williams' algorithm is very much in the spirit 
of the original, and itself works in the PCSP and RCSP settings. Arbitrarily partition the variables 
V into 3 equal-sized sets, A, B and C. Explicitly list every possible assignment a : A h^ [k] of 
variables in A and similarly every assignment /3 for B and 7 for C. Explicitly construct Maa, a 
k' ' x k' ' diagonal matrix with elements 

M AA (a,a) :=Y[Pv{a(v)) ■ ][ p xy (a(x), a(y)), 

veA xy£En{AxA} 

and likewise Mbb and Mqc- Also explicitly construct Mab, a k' ' x k' B ' matrix with elements 

M AB (a,P) := Y\ Pxy(a{x),P(y)), 

xy£En{AxB} 

and likewise Mbc an d Mqa- Finally, explicitly calculate 

M = pqMaaMabMbbMbcMccMca, 

the multiplication by p$ being a scalar product and the others matrix products. Triples (a,/?, 7) 
are in one to one correspondence with assignments a : V 1— ► [k] , so the terms of the trace of M, 

tr(M) = ^^Y,P® M AA(a,a)MAB(a,(3)MBB{f3,f3)MBc{f3,j)Mcc(in)M C A(7,a), 
a /3 7 

are in one to one correspondence with the terms of I(cr), merely distinguishing whether each 
monadic score is associated with A, B, or C, and each dyadic score with Ax A, Ax B, etc. Thus, 
the partition function is the trace, 

Zi = tr(M). 

6.2. Complexity. Given that two n x n matrices over an arbitrary ring can be multiplied using 
O^n^) ring operations, we have established the following result. 

Theorem 10. Let R be a ring and let G be a graph with n vertices. Let I be any RCSP over R, 
with constraint graph G and domain [k] . Then the algorithm above calculates the partition function 
Zj with O (/c wn ' 3 ) ring operations. 

For PCSPs, as we discuss in Section [10], under modest conditions each ring operation takes time 
0*(1), that is, time polynomial in the input size. In this case, the algorithm will run in time and 
space 0*( y k ujn ' 3 Y (Space 0(k 2n ' s ) suffices if the fast matrix multiplication algorithm uses linear 
space.) 

6.3. A remark. This version of the algorithm is simpler than Williams' because matrix multipli- 
cation provides exactly the sum-of-products that RCSP calls for, not the maximum-of-sums needed 
for the usual 2-CSP. To get the latter, Williams combines the monadic scores with the dyadic ones 
(this would correspond to incorporating Maa into Mab)) "guesses" an optimal score s and how 
it is partitioned as s = sab + sbc + sca among A x B, B x C, and C x A; defines zero-one 
matrices so that M4s(a,/3) = 1 if the assignments a and (3 produce the desired value sab, and 
otherwise; and multiplies these binary matrices to find (and count) triangles, which correspond to 
assignments (a, /3, 7) yielding sab + sbc + sca- Williams' original algorithm must iterate over all 
guesses, of which there are 0(m 3 ) even for 0-1 score functions, while no such guessing is involved 
in the sum-product version. 
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7. A REDUCTIVE ALGORITHM 

The preponderance of algorithms for CSPs such as Max 2-Sat work by reduction: they reduce 
an input instance to one or more "smaller" instances in such a way that the solutions to the latter, 
found recursively, yield a solution to the former. The fastest polynomial-space algorithm for general 
Max 2-CSP is the reductive algorithm of [SS07], running in time 0*(fc 19m ' 100 ) for an instance with 
m 2-variable constraints and domain [k]. While we will not describe that paper's "Algorithm B" 
in detail, if an instance I has (a constraint graph with) a vertex of degree 0, 1, or 2, a Type 0, 

1, or 2 reduction (respectively) replaces I with an equivalent, smaller instance V . Otherwise, a 
Type 3 reduction replaces I with k smaller instances Iq, . . . , ijfe-i, the largest of whose solutions is 
the solution to /. If a Type 3 reduction results in a disconnected graph, the components are solved 
separately and summed. 

The algorithm extends to PCSP (and indeed RCSP), the key point being that the Type 0, 1, 

2, and 3 CSP reductions have PCSP (and RCSP) analogues. By working with polynomials rather 
than real numbers we are able to carry substantially more information: for instance, the old CSP 
reductions correspond to the new PCSP reductions with the polynomials truncated to their leading 
terms. 

7.1. Complexity. We exhibit the extended reductions below, where it will also be evident that 
each can be performed using 0(k s ) ring operations. The extension of [SSOTj's Algorithm B to 
RCSPs can then be seen to satisfy the following. 

Theorem 11. Let R be a ring and let G be a graph with n vertices and m edges. Let I be any 
RCSP over R, with constraint graph G and domain [k]. Then the extended Algorithm B calculates 
the partition function Zj in polynomial space and with o*(k 19m > 100 ) ring operations. 

7.2. The reductions. A Type reduction expresses the partition function of an RCSP as a 
product of partition functions of two smaller instances (or in an important special case, a single 
such instance) . Type 1 and 2 reductions each equate the generating function of an instance to that of 
an instance with one vertex less. Finally, a Type 3 reduction produces k instances, whose partition 
functions sum to the partition function of the original instance. In each case, once the reduction is 
written down, verifying its validity is just a matter of checking a straightforward identity. 

A word of intuition, deriving from the earlier CSP reductions, may be helpful (though some 
readers may prefer to go straight to the equations). A CSP 2-reduction was performed on a vertex 
v of degree 2, with neighbors u and w. The key observation was that in a maximization problem, 
the optimal assignment a(v) is a function of the assignments a(u) and a(w). That is, given any 
assignments o~(u) = i and a(w) = j, the optimal total s uw (i,j) of the scores of the vertex v, the 
edges uv and vw, and the edge uw (if present, and otherwise taken to be the score function) is 

(8) s uw (i,j) = m&x{s uw (i,j) + s uv (i,l) + s v (l) +s vw (l,j)}. 

By deleting v from the CSP instance and replacing the original score s uw with the score s uw , we 
obtain a smaller instance with the same maximum value. A 2-reduction in the RCSP context works 
the same way, except that the max-of-sums of (jHJ) is replaced by the sum-of-products of (fTOj) , 



Type Reduction. Suppose / is an RCSP instance whose constraint graph G is disconnected. 
Let V = V\ UV2 be a nontrivial partition such that e(Vi, V2) = 0. Let I\ and Li be the subinstances 
obtained by restriction to V\ and V2, except that we define the nullary scores by 

Jh) _ (I) (/a) _ , 

P® -P® , P® - -L- 



Polynomial CSP 11 

A straightforward calculation shows that, for any assignment a : V — > [k], we have 

H^) = n • n ^(°"( u )) • II Pxy( a ( x ^ a (y)) 

vGV xyGE 

P0 • II M°(. v )) ■ II Px V {cr{x),a(y))\\l-]\p v (a(v))- JJ p xy (a(x),a{y)) 
v€Vi xyeE{G[Vi]) J \ v&V 2 xy&E{G[V 2 \) 

-= Ii(cri)I 2 (<7 2 ), 

where cij denotes the restriction of a to Vi. It follows easily that 

Zj = Zj x Zi 2 . 

Thus in order to calculate Zj it suffices to calculate Zi x and Zi 2 . 

When one component of G is an isolated vertex v, with V = V \ v, one term becomes trivial: 

Z I = (iP0 • Z^ 6 [fc] Pw(*)) • ^J 2 - S ° in tllis CaSe 

Z/ = Zj, 

where I is the instance obtained from / by deleting v, defining 

m(?) =n ■ J2 Pv ^ 

ie[k] 

(a trivial calculation), and leaving all other scores unchanged. This reduces the computation of the 
partition function of / to that of an instance one vertex smaller. 

Type 1 Reduction. Suppose that / is an instance with constraint graph G, and v € V has 

degree 1. Let w be the neighbor of v. We shall replace I by an "equivalent" instance I (one with 
the same partition function) with constraint graph G\v. 
We define the instance I by giving w vertex scores 

fc-l 
(9) Pw{i) =Pw(i)^2(Pwv(i,j) -PvU))- 

3=0 

All other scores remain unchanged except for p v (*) and p vw (*i *), which are deleted along with v. 
To show that Zj = Zj, let a : V \ v — * [k] be any assignment and, for j € [k], extend a to 
a 3 : V — > [k] defined by a 3 \v) = j and a J \y\ v = a. Using §§§, we have 

I(v)=P»- II ^( a ( x )) ' II Pxy{o-{x),a{y)) 

xGV\v xy£E(G\v) 

P0- II Px(cr(x))- Y[ Pxy(o-(x),a(y))\ ■ ip w (cr(w))-^2Pwv(cr(w),j)-Pv(j) 

xeV\{v,w} xyeE(G\v) ) \ j=0 

fc-l 
^,P0 ■ Yl Pv(cr J (v)) ■ Yl Pxy(cr J (x),a J {y)) 

j=0 v£V xy&E 

fc-l 

j=0 
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and so 



Zl 


= E 

<r:V-»[fc] 


1(a) 

fc-i 




= E 


E J (^') 




cr:V\v-> 


[fc] 3=0 




= E 


7(a) 




er:F\v-> 


[k] 




= Z T 





Type 2 Reduction. Suppose that / is an instance with constraint graph G, and v € V has 
degree 2. Let it and w be the neighbors of v in G. We define an instance I with constraint graph G, 
which will have fewer vertices and edges than G. G is obtained from G by deleting v and adding 
an edge uw (if the edge is not already present). By analogy with the plain CSP 2-reduction from 
[SS07] and recapitulated above around (|5J), we define I by setting, for i,j £ [k], 

fc-i 

(10) Puw(hj) =Puw(i,j) -^2puv(iJ)Pv(l)Pvw(l,j), 

1=0 

where we take Puw(i,j) = 1 if edge uw was not previously present. All other scores remain un- 
changed except for p v (*), p V u(*, *) and p vw (*, *), which are deleted. 

To show that Zj = Zy, let a : V \ v — ► [k] be any assignment. As before, we write a 1 for the 
assignment with a l \y\ v = a and a (v) = I. Then 

T(<r)=PQ' n ^(°"( x )) ' II Pxy( a ( x )> a (y)) 

x£V\v xy&E{G) 

fe-1 

= P% II Px(cr(x)) ■ Y\ Pxy(<y(x),a(y)) ■^2p uv (a(u),l)p v (l)p vw (l,a(w)) 

xeV\v xy£E(G\v) 1=0 

fe-1 

= Y^ p 9 II ^( fj/ ( x ))' II Pxy(<y l (x),a\y)) 
1=0 x&V\v xy£E(G\v) 

fe-1 

1=0 

As with Type 1 reductions, this implies that Zj = Zj. 

Type 3 Reduction. Suppose that I is an instance with constraint graph G, and v € V has degree 
3 or more. Let U be the set of neighbors of v in G. We define k instances Iq, . . . ,Ik-i each with 
constraint graph G = G\v. For i £ [k], the ith instance Ii corresponds to the set of assignments 
where we take a(v) = i. 

We define miliary scores for Ii by setting 

P? =P0-Pv(i) 
and, for each neighbor u G £7 of v, and each j G [A;], vertex scores 

(^^(j) =Pu(j) -Puv(j,i)- 

All other scores remain unchanged from / except for p v (*) and p v * (*,*), which are deleted along 
with v. 
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For any assignment a : V\v — ► [k], and i € [k] we write (as usual) a 1 : V —>■ [k] for the assignment 
with cr l \v\v = & and cr l (v) = i. Then, writing W = V \ (v U T(v)), 

h{o)=h- II Px{cr{x)) ■ Y\ Pxy{<? l {x),o\y)) 
xeV\v xyGE(G\v) 

= \p<bPv(v) ■ ( n Px(v(x)) ■ n b«(^))pui>(o"(XM)]) • ( n p^^^)^^))) 

x&W u&U xy£E(G\v) 

= P$Y[Px{^(x))- Yl Pxy{^ i (x),a i (y)) 

xeV xy£E 

= V). 

Then 

cr:V^[fc] 

= E X><) 

cr:V\l)^[fc] i=0 

= E Ew 

«r:V\u-t-[fe] i=0 

= E E *(*) 

i=0 a:V\v-*[k] 
fc-1 

-IX 

i=0 

Thus the partition function for I is the sum of the partition functions for the ij. 

We can also write down another reduction, generalizing the Type reduction above, although 
we will not employ it here. 

Cut Reduction. Suppose that I is an instance with constraint graph G, and let Vq C V be a vertex 
cut in G. Let G\, . . . , G r be the components of G\Vq. We can calculate Zj by dividing assignments 
into classes depending on their restriction to Vq. Indeed, for each assignment cfq : Vq — ► [k], let us 
define 

I o=P<D ■ Y[ Pv(<ro(y)) Y\ Pxy{<ro(x),(To(y)), 

veVo xyeE(G[V }) 

and r instances I°° , . . . , I°° where I^° has score polynomials 

n = 1 

Vu £ Vb Pv{i) =Pv(i) ■ J\ Pwv{cr (w),i) 

w£Vo 

Vvw £ E[G \ Vb] p vw (i,j) = Pvw(i,j), 
where we take p wv (i,j) = 1 if wv E. 
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Then for any a : V — > [k], we have 

I(a)=I (o\V )-f[lf Vo (a\V i ) 

i=\ 

and so 

r 

Zj = ^2 h (o"o ) 17 Z^o . 

ffo:Vo-+[fc] i=1 

Thus the partition function Z/ is the sum of \Vq\ products: if \Vq\ is small, this provides an effective 
reduction to smaller cases. Note that this is what we have done in the Type 2 reduction above, 
where we also use the fact that, for a cutset of size two, the partition functions for the subinstance 
consisting of the single vertex v can be encoded using constraints on its neighbors. Unfortunately, 
this is not in general possible for larger cutsets. 

8. AN ALGORITHM FOR SPARSE SEMI-RANDOM INSTANCES 

An algorithm closely related to the CSP Algorithm B referenced above, and using the same 
reductions, solves sparse boolean CSP instances in expected linear time [SS06b]. Specifically, for 
A = A(n) > and a random graph G(n,c/n) with c < 1 + An -1 ' 3 (below the giant-component 
threshold or in its so-called scaling window), a boolean CSP instance I with constraint graph G is 
solved in expected time 0{n) exp(l + A 3 ). 

8.1. Complexity. Because the reductions extend to RCSP, it follows the algorithm extends im- 
mediately to boolean RCSP instances on such a constraint graph. 

Theorem 12. Let A = A(n) > 0. For c < 1 + An^ 1 ' 3 , let G € Q(n,c/n) be a random graph, and 
let I be any boolean RCSP over any ring R with constraint graph G. Then the algorithm above 
calculates the partition function Z[ with an expected 0(n) exp(l + A 3 ) ring operations. 

9. Dynamic programming for graphs of small treewidth 

Roughly, a tree decomposition of a graph G = (V, E) consists of a tree T each of whose vertices 
is associated with a "bag" B C V, such that every vertex of G appears in some bag, every edge of 
G has both endpoints in some bag, and the set of bags containing any given vertex v of G forms 
a subtree of T. The "width" of a tree decomposition is one less than the cardinality of the largest 
bag, and the treewidth of G is the minimum width of any tree decomposition. For our purposes we 
will assume that a tree decomposition is given, though for graphs of constant-bounded treewidth, 
a minimum-treewidth decomposition can be found in linear time [RS95, Rcc92| lBod96| . 

Efficient algorithms for various sorts of constraint satisfaction and related problems on graphs 
of small treewidth have been studied since at least the mid-1980s, with systematic approaches 
dating back at least to [DP871 IDP891 |A"P89| . A special issue of Discrete Applied Mathematics was 
devoted to this and related topics in 1994 [AHE94], and the field remains an extremely active area 
of research. 

A recent series of results on treewidth is Monien and Preis's proof that every cubic graph with 
m edges has bisection width at most (1/6 + o(l))m [MPOlj . Fomin and H0ie's use of this to show 
that the pathwidth of a cubic graph is bounded by (1/6 + o(l))m [FH06| . and the use of this to 
prove that any graph has treewidth at most (13/75 + o(l))m [SS07, KMRR09J. 

Focusing on two problems of particular interest to us, as a key part of an approximation result 
in Jansen, Karpinski, Lingas and Seidel's [JKLS05] . it was shown that given a graph G and a tree 
decomposition T of width 6—1, an exact maximum (or minimum) bisection of G can be found in 
time and space O* (2 fe ) , using dynamic programming on the tree decomposition. To find a maximum 
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clique in time 0*(2 fc ) is even easier, and may be done in polynomial space, using the well-known 
fact that any clique of G is necessarily contained in a single bag of T. 

Here, we generalize these results for bisection and clique to the entire class RCSP. While the 
extension is reasonably straightforward, the details are confusing enough that it is valuable to have 
the general result available in this neatly packaged form, and not have to apply the approach (or 
wonder if it can be applied) to a particular problem. 



9.1. Complexity. 

Theorem 13. Let G be a graph with n vertices, m edges, and a tree decomposition T of width b— 1. 
Let L be any RCSP instance over G. Given L and T, the partition function Zj can be calculated in 
space 0(k b ~ 1 ) and with 0*(k b ) ring operations. 

We sketch the proof before detailing it. We express an assignment score as a product of "bag 
scores" , fs for a bag B. In computing the partition function, we reduce on any leaf bag B\ of T and 
its parent B2 , in a way closely related to the Type 1 reductions of Section [7) variables appearing 
only in B\ can be "integrated out" to yield a function f' B on a smaller set of variables. By the 
definition of a tree decomposition, that smaller set is a subset of the variables of B2, so we can 
absorb f' B into fs 2 by defining f' B = fs 2 • f' B ■ Deleting B\ from T, and forgetting fs 1 , completes 
the reduction. We now present this more formally. 

Proof. We first show how to express the score of an assignment as a product of "bag scores" . Let 
B be the collection of all bags. While a vertex is typically found in several bags, associate each 
vertex v G V(G) with just one bag that contains it; similarly, associate each edge e G E(G) with 
some bag containing both its endpoints. For a bag B, let V(B) be the set of associated vertices 
and E(B) the set of associated edges, so that ^SbgbV(B) = V(G) and ^SbgbE(B) = E(G). Define 
each bag score as the product of its associated vertex and edge scores: 

(11) /b(o-|b) = n Pv{o-{v)) ■ Yl Pxy{o-{x),a(y)). 

veV(B) xy€E(B) 

(Note that for an edge xy G E(B), the endpoints x and y need not be in V(B), but they must be 
in B itself, and thus /b(c|b) is well defined, depending only on assignments of vertices in B.) It is 
clear that for the original PCSP instance I, 

Zi := Yl p ' 11^^)) ' IT Pxy( a ( x )> a (y)) 

a: V-*[k] veV xyeE 

(12) = e » n /*(*!*)• 

a: V-+[k] BeB 

We now show a reduction that will preserve the form of (I12p . but with one bag fewer. After 
reduction, the bag scores themselves will no longer depend simply on vertex and edge scores as 
in (jlip . which is why we work with (|12D instead. 

Let B\ be a leaf bag and B2 its parent. Let V = B\\B2- By the nature of a tree decomposition, 
V' is the set of vertices found exclusively in B\. Let V" = V\V. Correspondingly, we will use a' 
and a" for assignments from these sets to [k], and (a', a") for a complete assignment from V to \k\. 
In the following rewriting of (I12j) . lines (|14h and (|15j) are implicit definitions of the functions f B 
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and f' B% . 

(13) Zi= Yl P0'( II M^'lfi) •/B 2 (oiB 2 )-/ Bl (CT| Bl ) 

<r: V^[fe] \BSB\{-Bi,-B 2 } / 

X] P0' II /b(^"|b) •/b 2 (^"|b 2 )- X /Bi«VV)|Bi) 

cr":y"-*[fc] \BeB\{Bi,B 2 } / ff' : V'->[k] 

(14) = E «•( II /BKb))-/B 2 (^k)-/ Bl K , |B 1 nB 2 ) 

a":V"-*[k] \BeB\{Bi,B 2 } / 

(15) = E «•( II fB^"\ B )\ ■ f B2 (a"\ B2 ). 

cr":V"-+[k] \BGB\{Bi,B 2 } / 

This reduces the number of bags by one, as desired. Note that the definition implicit in (|14p . that 
for any a" : V" —>■ [k] , 

/Bi((°"")lBinB 2 ) := E fBi({°" ,v')\b x ), 

a': V'-*[fc] 

is well defined because the right-hand side depends only on values of (a" , a') on the set B\ n -E>2- 
Likewise, the definition implicit in (|15|). that for any u": V" — » [fc], 

/b 2 (°""Ib 2 ) := /b 2 (o""|b 2 ) • /^((OlfiinBa), 

is well defined because the right-hand side depends only on values of (a", a') on the set B^- 

In going from ([15]) to (fT5j) we reduced the decomposition tree size by 1. To do so we represented 
the functions f' B and f' B explicitly, writing down respectively k' ' and k ' ' • a '' score values. The 
reduction does not change the size of any bag, so if initially each bag has size at most 6 — 1, each 
reduction takes space 0(k b ^ 1 ) and requires 0*(£; 6-1 ) ring operations (each bag evaluation may 
require evaluating polynomially many vertex and edge scores). 

The algorithm really does require extra space 0(k ) to represent the updated score function 
f' B , because f' B incorporates fs 1 "integrated out" over a' : V' — > [k] and can no longer be viewed 
as coming from a bag of vertex and edge scores. We conclude that altogether the algorithm takes 
space 0*{k ) and uses 0*(k ) ring operations. □ 

10. Controlling the ring-operation complexity 

In this section, we restrict to Polynomial CSPs, and consider the complexity of performing the 
ring operations in our algorithms. In general, PCSP instances can be intractable. For example, 
consider a boolean ([k] = {0, 1}) PCSP instance / where all dyadic scores are 1, and the monadic 
scores are p v (0) = 1, p v (l) = z Tv , with mutually incommensurable real values r v . Then each of the 
2 n variable assignments corresponds to a different monomial in the partition function Z{I). Since 
the output is of size 0(2 n ), there is no hope of a time- or space-efficient algorithm. 

However, some modest restrictions suffice to control the size of the elements of the polynomial 
ring with which we are working (including the output itself) , and thus the time- and space-efficiency 
of the various algorithms. 

10.1. Polynomially bounded instances. The simplest well-behaved case is when an instance is 
over a single variable, and all score polynomials are of low degree, with small coefficients. This is 
easily formalized and generalized. 

Definition 14 (Polynomially bounded). A family J- of instances of PCSP is polynomially bounded 
if for an instance I G J- having n variables and m clauses, with respect to m + n: 
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• the number of formal variables is 0(1), 

• all powers are integers with absolute value 0*(1), and 

• coefficients are integers of length 0*(1), or sums and products of coefficients are deemed to 
be elementary operations. 

For convenience we will speak of "a polynomially bounded instance" . 

Lemma 15. In any of the four algorithms described, over a fixed domain [k], solving a polynomially 
bounded instance I with n variables and m clauses, each ring operation can be performed in time 
and space 0*(1). 

Proof. Say that there are c variables, each with maximum degree A. As a sum of products, the 
partition function itself has maximum degree (in each variable) at most (m + n + 1)A. This 
is also true of any polynomial that arises in the course of the computation. This can be verified 
directly by looking at the details of each algorithm. Alternatively, note that the algorithms compute 
"obliviously" on the polynomials (the operations performed depend on the constraint graph but 
not on the polynomials), the algorithms never subtract, and thus if on any input an algorithm 
ever produced terms of higher degree than the degree bound on the partition function, it would 

certainly do so when each input score polynomial is 1 + z H 1- z (or the equivalent multivariate 

polynomial), and (in the absence of any negative coefficients) such terms would survive to appear 
in the partition function, a contradiction. 

Since each intermediate polynomial has maximum degree at most (m + n + 1)A, it has at most 
((m + n + 1)A) C = 0*(1) terms, and two such polynomials can be multiplied with 0*(1) integer 
operations. 

If we wish to further break down the time as a function of the lengths of the integer coefficients, 
each arises as a sum of at most k m+n+1 products of at most m + n + 1 input coefficients, and 
therefore has length at most (m + n + 1) log k ■ (m + n + 1) = 0*(1) times that of the longest input 
coefficient. Thus, adding and multiplying coefficients can also be done in time 0*(1). □ 

10.2. Pruned instances. An alternative for a PCSP in a single variable z is to allow arbitrary 
(real) powers in the score polynomials, but to avoid the blowup in our motivating example by 
demanding not the full partition function but just its leading term (the one with the highest power 
of z). 

Definition 16 (z-prunable). An instance of PCSP with n variables and m clauses (more precisely 
a family of instances) is z-prunable if, with respect to m + n: 

• the number of formal variables is 0(1), 

• z may have arbitrary real powers, 

• the remaining variables W\,W2, ■ ■ ■ have integer powers with absolute value no more than 
0*(1), and 

• coefficients are nonnegative integers of length 0*(1), or coefficients are nonnegative reals 
and sums and products of coefficients are deemed to be elementary operations, 

where the O(-) notation is with respect to m + n. 

Note that a PCSP arising as the generating function of a CSP (see Section 15. ip is always z- 
prunable: there is just a single formal variable z, and all coefficients are 1. 

Pruned polynomial. Given a polynomial p in one variable z, we define the pruned polynomial 
(p) z to be the polynomial obtained by removing all but the leading term. If p is a polynomial in 
variables z,wi,W2, ■ ■ ., we obtain (p) z by throwing away all terms T such that there is a term of 
the form cTz 1 , where c > is real, and i > 0. For instance, 

(2z + 2>Z + 700 + ZW\ + Z W\ + ZW2 + Z W\W2) Z = 2z + Z W\ + ZW2 + Z W±W2- 
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Pruning a partition function often preserves all the information of interest. Consider for example 
a PCSP I arising from a simple maximization CSP, say weighted Max Cut. In the partition function 
Zj, the leading term's power is the weight of a largest cut, and its coefficient the number of such 
cuts. (For this informal discussion, we will simply multiply-count cuts that are symmetric to one 
another.) With real edge weights, Zj can have exponentially many terms, but the weight of a 
largest cut, and the number of such cuts, is preserved in the pruned partition function (Zj) z , which 
consists solely of that leading term. 

Somewhat more generally, consider a CSP where we wish to maximize one (real-valued) parame- 
ter for one or more values of several other (integer- valued) parameters. For example, the maximum 
bisection of an edge-weighted graph means maximizing the weight of cut edges, when the number 
of vertices in one partition is held to n/2. Terms in the partition function Zi of the correspond- 
ing bivariate PCSP have real powers of the edge-weighing variable z, and integral powers of the 
variable w that counts vertices in partition 1. Again (by definition) pruning to (Zj) z preserves the 
maximum power of z and the corresponding coefficient for any fixed power of w, so from (Zj) z we 
may read off the weight of a maximum bisection and the number of such bisections, and indeed the 
weight and cardinality of maximum cuts of any specified sizes (say with exactly n/3 vertices in one 
partition) . 

Pruning in the algorithms. The key point about pruning is that for polynomials p and q (in any 
set of variables) if coefficients are nonnegative then (p + q) z = (p z + q z ) z and (pq) z = ((p)z(q)z)z- 

With either of the reductive algorithms described, the pruned partition function can be obtained 
efficiently, first pruning the input instance, then pruning as we go. (Correctness of this can be seen 
inductively, working backwards from the last reduction.) It is a trivial observation that for each 
reduction R (of Type 0-3), and any instance /, {R{I)) Z = (R(I Z )) Z . In particular, if we perform a 
full sequence of reductions and z-prune at every stage, we will end up with (Zj) z . 

Pruning can also be done for tree decomposition-based dynamic-programming algorithms. Refer- 
ring to the proof of Theorem [131 we simply note that the pruned partition function can be obtained 
from pruned versions of f Bl , f' Bl , /b 2 , and f' B2 . 

This establishes the following lemma. 

Lemma 17. Given a z-prunable PCSP instance I with n variables and m clauses over a fixed 
domain [k] , the pruned partition function can be calculated by the reductive or dynamic programming 
algorithms (see Theorems \11[ [7H and \ 13\) with each ring operation taking time and space 0*(1). 

It is not likely that the PCSP extension of Williams' algorithm can accommodate real-valued 
powers, even with pruning. One problem is the use of fast matrix multiplication as a "black box" : 
it is unlikely that we can prune within the multiplication algorithm. Another piece of evidence 
against being able to accommodate real-valued PCSP powers is that in the CSP setting, Williams 
was unable to accommodate real- valued scores (the analogue of the PCSP's powers), and a solution 
for PCSP would imply one for CSP. Williams noted that one could work around real- valued scores 
by approximation methods, for example multiplying all scores by a large constant and then replacing 
each with the nearest integer, and this can also be done in the PCSP setting. 

11. Summary of PCSP algorithmic results 

Combining the results in Sections EHHl phrased in terms of the number of ring operations, with 
the bounds in Section [10] on the complexity of these operations for PCSPs, gives the following 
general conclusions. Some particular results are presented in Section [T3l 

Theorem 18. The PCSP extension of Williams split- and-list algorithm [Wil04] solves any poly- 
nomial^ bounded PCSP instance with n variables over domain [k] in time and space 0*(fe ajn ' 3 ), 
where lo is the matrix-multiplication exponent for the ring of polynomials over reals. 
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Parametrizing by vertices, this exponential-space algorithm is the only known algorithm faster 
than 0*(k n ) for PCSP, just as it is for CSP. In particular, it is the only algorithm efficient for 
dense PCSP instances. 

Theorem 19. The PCSP extension of of Scott and Sorbin's reductive algorithm [SS07] solves any 
polynomially bounded PCSP instance (or finds the pruned partition function of any prunable PCSP 
instance) with n variables, m clauses, and over domain [k], in time 0*(k l9m > 100 ) and space 0*(1). 

This algorithm is suitable for sparse instances (if m > (100/19)n then the naive 0(k n ) algorithm 
is better). It is not as fast as dynamic programming (see Theoreml2ip. but unlike that and Williams' 
algorithm it runs in polynomial space, and is potentially practical. 

Theorem 20. When G = G(n,c/n) is an Erdos-Renyi random graph with c < 1 + An -1 ' 3 , an 
extension of Scott and Sorbin's expected-linear-time CSP algorithm [SS06b] solves any boolean PCSP 
instance with constraint graph G in expected time 0*(l)exp(l + A 3 ) and in space 0*(1). 

This polynomial-space, polynomial-expected-time algorithm is of course the most efficient for 
sparse semi-random instances (below the giant-component threshold or in its scaling window). 

Theorem 21. Tree decomposition-based dynamic programming can solve any polynomially bounded 
PCSP instance (or find the pruned partition function of any prunable PCSP instance) with n 
variables, m clauses, over domain [k], and having treewidth b — 1, in time and space 0*[k b ) or 
(since b-l< (13/75 + o{l))m, from [SS071 [KMRR09] ) time and space 0*(k ( - 13 / 75+0 ^ m ) . 

This exponential-space algorithm is suitable for instances of small treewidth, including sparse 
instances with m < (75/13)n. 

12. Constructing and sampling solutions 

Wherever we can compute a CSP's maximum value, we can also produce a corresponding as- 
signment; and wherever we can count assignments producing a given value, we can also do exact 
random sampling from these assignments. The method is standard, and we illustrate with sam- 
pling. We construct our assignment one variable at a time, starting from the empty assignment. 
Given a partial assignment o"o : Vq — > [k] , and a vertex v Iq , we calculate the partition functions 

(16) Z l;*o(i) = E J ( ff )> 

<f- °-|vq=o"o, cr(v)=i 

and use these to determine the conditional distribution of a{v) given that a\j = oq. 

This enables us to sample from a variety of distributions. For instance, we get the following 
result. 

Theorem 22. Let G be a graph with m edges. Then in time Q*(2 19m ' 100 ^ and space 0*(1) we can 
sample uniformly at random from the following distributions: 

• maximum cuts, maximum bisections, minimum bisections 

• maximum independent sets, cliques of maximal size 

• independent sets of any fixed size, cliques of any fixed size. 

In the same time, we can sample from the equilibrium (Gibbs) distribution of the Ising model with 
any fixed interaction strength and external magnetic field. 

A similar result holds for edge- and/or vertex-weighted graphs, except that we sample at random 
only from optimal assignments. Many other problems can be expressed in this framework. For 
example, we can count and sample proper A;-colorings in time 0*(k 19m ' 100 ) and space 0*(1), or in 
time and space 0*(k^' 75+0 ^ m ). 
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If we wish to exhibit just one optimal assignment deterministically, we can do so by a small 
modification of the sampling approach above: at each step, assign the next vertex v the smallest i 
such that (Jq(i) can be extended to an optimal assignment (this can be read off from Zj. a u\ given 
by (USD). 

13. Applications and Conclusions 

The PCSP / RCSP framework encompasses many problems that are not in CSP, yet can be 
solved by extensions of any general CSP algorithm we know of, with essentially equal efficiency. 
For several problems, this yields the best algorithms known. 

13.1. Graph bisection. Jansen, Karpinski, Lingas and Seidcl [JKLS05J showed how to use dy- 
namic programming based on tree decomposition to find a maximum or minimum bisection of a 
graph. Combined with the bounds from [SS07] IKMRR09] on the treewidth of a graph with m 
edges, this gave an algorithm with a time and space bound of (9*(2^ 13 ' 75 +°( 1 )) rn ). 

Until now, no polynomial-space algorithm was known for this problem other than the naive 
enumeration running in time 0*(2 n ). Once bisection is phrased as a PCSP, our results (Theorem ll9p 
immediately imply that in time O*(2 19rn ' 10 °) and polynomial space, we can solve maximum and 
minimum bisection, count maximum and minimum bisections, and indeed count bisections of all 
sizes. We can do this also in weighted bisection models, as long as they are polynomially bounded 
or z-prunable. 

Our results (Theorem [18]) also mean that dense instances of bisection can be solved by Williams' 
algorithm, in time and space 0*(2 wn ' 3 ), again yielding the full generating function of cuts. 

13.2. Cliques and Independent Sets. Maximum Independent Set is in Max 2-CSP and has 
been studied extensively, but, surpisingly, PCSP can offer an advantage here. For simple MIS, 
for any graph with average vertex degree d, all four of our general-purpose PCSP algorithms are 
dominated either by [BEP08| (for average degree < 3, time o*(2 01331n )) or by |FGK06j (time 
0*(2°' 288n ), otherwise). However, for weighted MIS, counting MIS, or weighted counting MIS, the 
fastest algorithm we are aware of is the vertex-parametrized 0*(2°- 329On ) algorithm of Fiirer and 
Kasiviswanathan [FKQ5], based on an improved analysis of an algorithm of Dahllof, Jonsson, and 
Wahlstrom |D JW05] . For any of these problems the (3*(2 19m / 100 ) reductive algorithm of Theo- 
rem [19] is the fastest polynomial-space algorithm known if the average degree is below about 3.46, 
and the 0*(2' ' 75 +°( 1 )) m ) exponential-space dynamic-programming algorithm of Theorem 1211 the 
fastest algorithm known if the average degree is below about 3.79. 

We noted earlier that Maximum Clique is not in the class 2-CSP, because constraints are given 
by non-edges, not edges, but that it is in the class PCSP. This is nice, but does not currently offer 
any algorithmic advantage. If G has average degree d = o(n), the following simple algorithm has 
running time O* (2°^ n ' ) , dominating all other algorithms under consideration here. For any k of our 
choosing, any clique either is contained in the < n/k-oider subgraph induced by vertices of degree 
> kd, which is checkable in time 0*(2 n ' ), or contains a vertex of degree < kd, which is checkable 

in time O* (2 fcrf ) . Taking k = y/n/d gives running time O* ( 2^ nd ) . If d is not o(n) then certainly 

d = uj(1). This rules out edge-parametrized algorithms, leaving us only Williams' algorithm, which 
is less efficient than applying a vertex-parametrized MIS algorithm such as that of [FK05J to the 
complement graph G. 

13.3. Ising partition function. The Ising partition function, a canonical object in statistical 
physics, is a PCSP, and can be solved by any of the algorithms presented here. We know of no 
other general algorithm to find the partition function other than naive enumeration over all 2™ 
configurations. 
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13.4. Potts and other g-state models. The Potts (g-coloring) partition function can trivially 
be evaluated in time 0*{q n ), but it is by no means obvious that one can remove the dependence 
on q. A breakthrough result of Bjorklund, Husfeldt, Kaski, and Koivisto [BHKK08] (see also 
[BH06bl lKoi06al IBHKl rBHKK071 lBH06al IBH08] ) shows how to evaluate the Potts partition function 
(and therefore also the Tutte polynomial) using the Inclusion-Exclusion method, in time 0*(2 n ) 
in exponential space, or time 0*(3 n ) in polynomial space. 

Our PCSP model includes the Potts model and more general models where the "energy" is not 
simply a function of equality or inequality of neighboring colors. Our algorithms unfortunately 
do have have exponential dependence on q, but with the greater generality of PCSP this may 
be unavoidable: as noted in [BHKK08], Traxler [TraDS] shows that if the g-state Potts model is 
generalized even to 2-CSPs on variables with q states, exponential dependence on q cannot be 
avoided, at least in the vertex-parametrized case, assuming the Exponential Time Hypothesis. 

13.5. Conclusions. As we have shown, the PCSP formulation is clean and offers quantifiable 
advantages for a wide range of problems. However, the greatest benefit is its broad applicability. 
For example, exponential time algorithms have not previously been considered for problems such 
as judicious partitioning, and for virtually any form of this problem such algorithms now follow 
instantly from its membership in PCSP. Moreover, when solving optimization problems coded into 
PCSP form, we automatically get solutions to the corresponding counting problems. 

Finally, we note that algebraic formulations appear to be playing a newly important role in the 
development of exponential-time algorithms for CSPs and related problems. Our work along these 
lines began with [SS06aJ, and we have already discussed Koivisto's sum-of-products variation on 
Williams' split-and-list algorithm [Koi06b]. The methods leading to the 0*(2 n ) Tutte polynomial 
computation [BHKK08] work in a fairly general algebraic setting, and a new algorithm of Williams 
[Wil08j that finds a fc-path in an order-n graph in time O* (2 fc ) works in an algebra carefully chosen 
for properties including making walks other than simple paths cancel themselves out. PCSPs should 
be considered one piece of this new algebraic tool kit. 
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